Search CORE

6 research outputs found

Surgical Data Science - from Concepts toward Clinical Translation

Author: Cleary K
Collins J
Collins T
Eisenmann M
Fallert J
Feussner H
Fichtinger G
Forestier G
Giannarou S
Gibaud B
Gockel I
Goedeke J
Grantcharov T
Hager GD
Hashimoto DA
Hashizume M
Heckmann-Nötzel D
Jannin P
Joyeux L
Kenngott HG
Kikinis R
Lam K
Leff DR
Madani A
Maier-Hein L
Malpani A
Marcus HJ
Mascagni P
Meireles O
März K
Müller-Stich BP
Mündermann L
Nakawala H
Navab N
Neumuth T
Onogur S
Padoy N
Park A
Pugh C
Roß T
Sarikaya D
Seitel A
Speidel S
Stoyanov D
Sznitman R
Taylor RH
Teber D
Tizabi MD
Vedula SS
Wagner M
Ückert F
Publication venue: 'Elsevier BV'
Publication date: 18/11/2021
Field of study

Recent developments in data science in general and machine learning in particular have transformed the way experts envision the future of surgery. Surgical Data Science (SDS) is a new research field that aims to improve the quality of interventional healthcare through the capture, organization, analysis and modeling of data. While an increasing number of data-driven approaches and clinical applications have been studied in the fields of radiological and clinical data science, translational success stories are still lacking in surgery. In this publication, we shed light on the underlying reasons and provide a roadmap for future advances in the field. Based on an international workshop involving leading researchers in the field of SDS, we review current practice, key achievements and initiatives as well as available standards and tools for a number of topics relevant to the field, namely (1) infrastructure for data acquisition, storage and access in the presence of regulatory constraints, (2) data annotation and sharing and (3) data analytics. We further complement this technical perspective with (4) a review of currently available SDS products and the translational progress from academia and (5) a roadmap for faster clinical translation and exploitation of the full potential of SDS, based on an international multi-round Delphi process

UCL Discovery

Common Limitations of Image Processing Metrics:A Picture Story

Author: Acion Laura
Antonelli Michela
Arbel Tal
Bakas Spyridon
Bankhead Peter
Baumgartner Michael
Benis Arriel
Cardoso M. Jorge
Cheplygina Veronika
Christodoulou Evangelia
Cimini Beth
Collins Gary S.
Eisenmann Matthias
Farahani Keyvan
Glocker Ben
Godau Patrick
Gutierrez Clarisa Sanchez
Hamprecht Fred
Hashimoto Daniel A.
Heckmann-Nötzel Doreen
Hoffman Michael M.
Huisman Merel
Isensee Fabian
Jannin Pierre
Jäger Paul
Kahn Charles E.
Kainz Bernhard
Karargyris Alexandros
Karthikesalingam Alan
Kavur Emre
Kenngott Hannes
Kleesiek Jens
Kooi Thijs
Kopp-Schneider Annette
Kozubek Michal
Kreshuk Anna
Kurc Tahsin
Landman Bennett A.
Litjens Geert
Madani Amin
Maier-Hein Klaus
Maier-Hein Lena
Martel Anne L.
Mattson Peter
Meijering Erik
Menze Bjoern
Moher David
Moons Karel G. M.
Müller Henning
Nichyporuk Brennan
Nickel Felix
Noyan M. Alican
Petersen Jens
Polat Gorkem
Rajpoot Nasir
Reinke Annika
Reyes Mauricio
Riegler Michael
Rieke Nicola
Rivaz Hassan
Rädsch Tim
Saez-Rodriguez Julio
Saha Anindo
Schroeter Julien
Shetty Shravya
Stieltjes Bram
Sudre Carole H.
Summers Ronald M.
Taha Abdel A.
Tizabi Minu D.
Tsaftaris Sotirios A.
Van Calster Ben
van Ginneken Bram
van Smeden Maarten
Varoquaux Gaël
Wiesenfarth Manuel
Yaniv Ziv R.
Publication venue
Publication date: 01/01/2021
Field of study

While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. These are typically related to (1) the disregard of inherent metric properties, such as the behaviour in the presence of class imbalance or small target structures, (2) the disregard of inherent data set properties, such as the non-independence of the test cases, and (3) the disregard of the actual biomedical domain interest that the metrics should reflect. This living dynamically document has the purpose to illustrate important limitations of performance metrics commonly applied in the field of image analysis. In this context, it focuses on biomedical image analysis problems that can be phrased as image-level classification, semantic segmentation, instance segmentation, or object detection task. The current version is based on a Delphi process on metrics conducted by an international consortium of image analysis experts from more than 60 institutions worldwide.Comment: This is a dynamic paper on limitations of commonly used metrics. The current version discusses metrics for image-level classification, semantic segmentation, object detection and instance segmentation. For missing use cases, comments or questions, please contact [email protected] or [email protected]. Substantial contributions to this document will be acknowledged with a co-authorshi

arXiv.org e-Print Archive

Edinburgh Research Explorer

Understanding metric-related pitfalls in image analysis validation

Author: Acion Laura
Antonelli Michela
Arbel Tal
Bakas Spyridon
Baumgartner Michael
Benis Arriel
Blaschko Matthew
Büttner Florian
Calster Ben Van
Cardoso M. Jorge
Chen Jianxu
Cheplygina Veronika
Christodoulou Evangelia
Cimini Beth A.
Collins Gary S.
Eisenmann Matthias
Farahani Keyvan
Ferrer Luciana
Galdran Adrian
Ginneken Bram van
Glocker Ben
Godau Patrick
Haase Robert
Hashimoto Daniel A.
Heckmann-Nötzel Doreen
Hoffman Michael M.
Huisman Merel
Isensee Fabian
Jannin Pierre
Jäger Paul F.
Kahn Charles E.
Kainmueller Dagmar
Kainz Bernhard
Karargyris Alexandros
Karthikesalingam Alan
Kavur A. Emre
Kenngott Hannes
Kleesiek Jens
Kofler Florian
Kooi Thijs
Kopp-Schneider Annette
Kozubek Michal
Kreshuk Anna
Kurc Tahsin
Landman Bennett A.
Litjens Geert
Madani Amin
Maier-Hein Klaus
Maier-Hein Lena
Martel Anne L.
Mattson Peter
Meijering Erik
Menze Bjoern
Moons Karel G. M.
Müller Henning
Nichyporuk Brennan
Nickel Felix
Petersen Jens
Rafelski Susanne M.
Rajpoot Nasir
Reinke Annika
Reyes Mauricio
Riegler Michael A.
Rieke Nicola
Rädsch Tim
Saez-Rodriguez Julio
Shetty Shravya
Smeden Maarten van
Sudre Carole H.
Summers Ronald M.
Sánchez Clara I.
Taha Abdel A.
Tiulpin Aleksei
Tizabi Minu D.
Tsaftaris Sotirios A.
Varoquaux Gaël
Wiesenfarth Manuel
Yaniv Ziv R.
Publication venue
Publication date: 01/01/2023
Field of study

Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibility of metric-related knowledge: While taking into account the individual strengths, weaknesses, and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multi-stage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides the first reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Focusing on biomedical image analysis but with the potential of transfer to other fields, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. To facilitate comprehension, illustrations and specific examples accompany each pitfall. As a structured body of information accessible to researchers of all levels of expertise, this work enhances global comprehension of a key topic in image analysis validation.Comment: Shared first authors: Annika Reinke, Minu D. Tizabi; shared senior authors: Paul F. J\"ager, Lena Maier-Hei

arXiv.org e-Print Archive

Edinburgh Research Explorer

HAL-Rennes 1

Common limitations of image processing metrics: a picture story

Author: Antonelli M
Arbel T
Bakas S
Cardoso MJ
Cheplygina V
Eisenmann M
Farahani K
Ginneken BV
Glocker B
Heckmann-Nötzel D
Isensee F
Jannin P
Jäger P
Kahn CE
Kleesiek J
Kopp-Schneider A
Kozubek M
Kurc T
Landman BA
Litjens G
Maier-Hein K
Maier-Hein L
Menze B
Müller H
Petersen J
Reinke A
Reyes M
Rieke N
Rädsch T
Stieltjes B
Sudre CH
Summers RM
Tizabi MD
Tsaftaris SA
Publication venue: 'Center for Open Science'
Publication date: 13/04/2021
Field of study

While the importance of automatic image analysis is increasing at an enormous pace, recent meta-research revealed major flaws with respect to algorithm validation. Specifically, performance metrics are key for objective, transparent and comparative performance assessment, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. A common mission of several international initiatives is therefore to provide researchers with guidelines and tools to choose the performance metrics in a problem-aware manner. This dynamically updated document has the purpose to illustrate important limitations of performance metrics commonly applied in the field of image analysis. The current version is based on a Delphi process on metrics conducted by an international consortium of image analysis experts

Spiral - Imperial College Digital Repository

Surgical data science – from concepts toward clinical translation

Author: Cleary Kevin
Collins Justin
Collins Toby
Eisenmann Matthias
Fallert Johannes
Feussner Hubertus
Fichtinger Gabor
Forestier Germain
Giannarou Stamatia
Gibaud Bernard
Gockel Ines
Goedeke Jan
Grantcharov Teodor
Hager Gregory D.
HASHIMOTO Daniel A.
Hashizume Makoto
Heckmann-Nötzel Doreen
Jannin Pierre
Joyeux Luc
Kenngott Hannes G.
Kikinis Ron
Lam Kyle
Leff Daniel R.
Madani Amin
Maier-Hein Lena
Malpani Anand
Marcus Hani J.
Mascagni Pietro
MEIRELES Ozanan
März Keno
Müller-Stich Beat P.
Mündermann Lars
Nakawala Hirenkumar
Navab Nassir
Neumuth Thomas
Onogur Sinan
Padoy Nicolas
Park Adrian
Pugh Carla
Roß Tobias
Sarikaya Duygu
Seitel Alexander
Speidel Stefanie
Stoyanov Danail
Sznitman Raphael
Taylor Russell H.
Teber Dogu
Tizabi Minu D.
Vedula Swaroop S.
Wagner Martin
Ückert Frank
Publication venue: 'Elsevier BV'
Publication date: 18/11/2021
Field of study

International audienceRecent developments in data science in general and machine learning in particular have transformed the way experts envision the future of surgery. Surgical Data Science (SDS) is a new research field that aims to improve the quality of interventional healthcare through the capture, organization, analysis and modeling of data. While an increasing number of data-driven approaches and clinical applications have been studied in the fields of radiological and clinical data science, translational success stories are still lacking in surgery. In this publication, we shed light on the underlying reasons and provide a roadmap for future advances in the field. Based on an international workshop involving leading researchers in the field of SDS, we review current practice, key achievements and initiatives as well as available standards and tools for a number of topics relevant to the field, namely (1) infrastructure for data acquisition, storage and access in the presence of regulatory constraints, (2) data annotation and sharing and (3) data analytics. We further complement this technical perspective with (4) a review of currently available SDS products and the translational progress from academia and (5) a roadmap for faster clinical translation and exploitation of the full potential of SDS, based on an international multi-round Delphi process

INRIA a CCSD electronic archive server

Metrics reloaded: Pitfalls and recommendations for image analysis validation

Author: Acion Laura
Antonelli Michela
Arbel Tal
Bakas Spyridon
Bankhead Peter
Baumgartner Michael
Benis Arriel
Cardoso M. Jorge
Cheplygina Veronika
Christodoulou Evangelia
Cimini Beth
Collins Gary S.
Eisenmann Matthias
Farahani Keyvan
Glocker Ben
Godau Patrick
Gutiérrez Clarisa Sánchez
Hashimoto Daniel A.
Heckmann-Nötzel Doreen
Hoffman Michael M.
Huisman Merel
Isensee Fabian
Jannin Pierre
Jäger Paul F.
Kahn Charles E.
Karargyris Alexandros
Karthikesalingam Alan
Kavur A. Emre
Kenngott Hannes
Kleesiek Jens
Kopp-Schneider Annette
Kozubek Michal
Kreshuk Anna
Kurc Tahsin
Landman Bennett A.
Litjens Geert
Madani Amin
Maier-Hein Klaus
Maier-Hein Lena
Martel Anne L.
Mattson Peter
Meijering Erik
Menze Bjoern
Moher David
Moons Karel G. M.
Müller Henning
Nichyporuk Brennan
Nickel Felix
Petersen Jens
Rajpoot Nasir
Reinke Annika
Reyes Mauricio
Riegler Michael A.
Rieke Nicola
Rädsch Tim
Saez-Rodriguez Julio
Shetty Shravya
Sudre Carole H.
Summers Ronald M.
Taha Abdel A.
Tizabi Minu D.
Tsaftaris Sotirios A.
Van Calster Ben
van Ginneken Bram
van Smeden Maarten
Varoquaux Gaël
Wiesenfarth Manuel
Publication venue
Publication date: 07/07/2022
Field of study

The field of automatic biomedical image analysis crucially depends on robust and meaningful performance metrics for algorithm validation. Current metric usage, however, is often ill-informed and does not reflect the underlying domain interest. Here, we present a comprehensive framework that guides researchers towards choosing performance metrics in a problem-aware manner. Specifically, we focus on biomedical image analysis problems that can be interpreted as a classification task at image, object or pixel level. The framework first compiles domain interest-, target structure-, data set- and algorithm output-related properties of a given problem into a problem fingerprint, while also mapping it to the appropriate problem category, namely image-level classification, semantic segmentation, instance segmentation, or object detection. It then guides users through the process of selecting and applying a set of appropriate validation metrics while making them aware of potential pitfalls related to individual choices. In this paper, we describe the current status of the Metrics Reloaded recommendation framework, with the goal of obtaining constructive feedback from the image analysis community. The current version has been developed within an international consortium of more than 60 image analysis experts and will be made openly available as a user-friendly toolkit after community-driven optimization.Comment: Shared first authors: Lena Maier-Hein, Annika Reinke. arXiv admin note: substantial text overlap with arXiv:2104.0564

arXiv.org e-Print Archive